When stakes are high: Balancing accuracy and transparency with Model-Agnostic Interpretable Data-driven suRRogates

نویسندگان

چکیده

Technological advancements allow to develop high-performance black box predictive models. However, strictly regulated industries (like banking and insurance) ask for transparent decision-making algorithms. We therefore present a procedure Model-Agnostic Interpretable Data-driven suRRogate (maidrr) suited structured tabular data. Knowledge is extracted from via partial dependence effects. These are used perform smart feature engineering by grouping variable values. This results in segmentation of the space with automatic selection. A generalized linear model (GLM) fit features categorical format their relevant interactions. GLM serves as global surrogate original replaces it production. demonstrate our R package maidrr case study on general insurance claim frequency modeling six publicly available datasets. Our closely approximates gradient boosting machine (GBM) outperforms both tree benchmarks. • Procedure an interpretable complex system. Surrogate regarding accuracy fidelity. Automatic selection, local explanations. Satisfy transparency needs industry or high-stakes decision. Case prediction public

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MAGIX: Model Agnostic Globally Interpretable Explanations

Explaining the behavior of a black box machine learning model at the instance level is useful for building trust. However, what is also important is understanding how the model behaves globally. Such an understanding provides insight into both the data on which the model was trained and the generalization power of the rules it learned. We present here an approach that learns rules to explain gl...

متن کامل

Accuracy in Detecting High Stakes

Author(s): Clea Wright Whelan ; Graham Wagstaff ; Jacqueline M Wheatcroft Title: High stakes lies: Police and non-police accuracy in detecting deception Date: 2015. Appeared online 26 June 2014 Originally published in: Psychology, Crime and Law Example citation: Wright Whelan, C., Wagstaff, G., & Wheatcroft, J. M. (2015). High stakes lies: Police and non-police accuracy in detecting deception. ...

متن کامل

Crucial conversations: tools for talking when the stakes are high.

متن کامل

Local Interpretable Model-Agnostic Explanations for Music Content Analysis

The interpretability of a machine learning model is essential for gaining insight into model behaviour. While some machine learning models (e.g., decision trees) are transparent, the majority of models used today are still black-boxes. Recent work in machine learning aims to analyse these models by explaining the basis of their decisions. In this work, we extend one such technique, called local...

متن کامل

Making clinical decisions when the stakes are high and the evidence unclear.

Dylan, a 20 month old boy, was referred to a paediatric allergy clinic for assessment of his peanut allergy. At 12 months of age he developed facial contact urticaria to peanut butter, which spontaneously resolved without respiratory or other symptoms. Since then, he has not had further reactions or eaten peanuts, although the rest of the family often eat peanuts and nuts. Dylan is regularly ca...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Expert Systems With Applications

سال: 2022

ISSN: ['1873-6793', '0957-4174']

DOI: https://doi.org/10.1016/j.eswa.2022.117230